6' 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: Eaton, Dan L. 

de Sauvage, Frederic J. 

(ii) TITLE OF INVENTION: MPL LIGAND 

10 (iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 
15 (C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

20 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

25 

(vi) CURRENT APPLICATION DATA: 
/ (A) APPLICATION NUMBER: 08/43 0035 

(B) FILING DATE: 27-Apr-1995 

(C) CLASSIFICATION: 
30 
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(B) FILING DATE: 21- JAN- 1994 
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40 (A) APPLICATION NUMBER: 08/176553 
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(A) NAME: Winter, Daryl B. 
45 (B) REGISTRATION NUMBER: 32,637 

(C) REFERENCE /DOCKET NUMBER: P0871P2D2 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 415/225-1249 
50 (B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 



-83- 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala Arg Leu Thr Leu 
-16 -15 -10 -5 

Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys 
15 10 

Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu 
15 20 25 26 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



GAATTCCTGG AATACCAGCT GACAATGATT TCCTCCTCAT CTTTCAACCT 50 

CACCTCTCCT CATCTAAGAA TTG CTC CTC GTG GTC ATG CTT 91 

Leu Leu Leu Val Val Met Leu 
-16 -15 -10 

CTC CTA ACT GCA AGG CTA ACG CTG TCC AGC CCG GCT CCT 130 
Leu Leu Thr Ala Arg Leu Thr Leu Ser Ser Pro Ala Pro 

-5 1 

CCT GCT TGT GAC CTC CGA GTC CTC AGT AAA CTG CTT CGT 169 
Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu Leu Arg 
5 10 15 

GAC TCC CAT GTC CTT CAC AGC AGA CTG GTGA GAACTCCCAA 210 
Asp Ser His Val Leu His Ser Arg Leu 
20 25 26 

CATTATCCCC TTTATCCGCG TAACTGGTAA GACACCCATA CTCCCAGGAA 260 

GACACCATCA CTTCCTCTAA CTCCTTGACC CAATGACTAT TCTTCCCATA 310 

TTGTCCCCAC CTACTGATCA CACTCTCTGA CAAGAATTAT TCTTCACAAT 360 
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10 



15 



20 



25 




35 



40 



45 




ACAGCCCGCA TTTAAAAGCT CTCGTCTAGA 3 90 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



TCTAGACGAG AGCTTTTAAA TGCGGGCTGT ATTGTGAAGA ATAATTCTTG 50 
TCAGAGAGTG TGAT CAGTAG GTGGGGACAA TATGGGAAGA ATAGTCATTG 100 
GGTCAAGGAG TTAGAGGAAG TGATGGTGTC TTCCTGGGAG TATGGGTGTC 150 
TTACCAGTTA CGCGGATAAA GGGGATAATG TTGGGAGTTC TCACCAGTCT 200 
GCTGTGAAGG ACATGGGAGT CACGAAGCAG TTTACTGAGG ACTCGGAGGT 250 
CACAAGCAGG AGGAGCCGGG CTGGACAGCG TTAGCCTTGC AGTTAGGAGA 3 00 
AGCATGACCA CGAGGAGCAA TTCTTAGATG AGGAGAGGTG AGGTTGAAAG 350 
ATGAGGAGGA AATCATTGTC AGCTGGTATT CCAGGAATTC 3 90 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr 
-21 -20 -15 -10 

Ala Arg Leu Thr Leu Ser Ser Pro Ala Pro Pro Ala Cys Asp Leu 
-5 15 

Arg Val Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu His Ser 
10 15 20 

Arg Leu Ser Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val 
25 30 35 

Leu Leu Pro Ala Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin 
40 45 50 
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10 



15 



20 



25 



35 



40 



45 



50 



Met Glu 
55 

Leu Leu 
70 

Cys Leu 
85 

Leu Leu 
100 

Gin Gly 
115 

Ser Phe 
130 

Val Gly 
145 

Ala Val 
160 

Pro Asn 
175 

Ala Arg 
190 

Arg Ala 
205 

Asp Gin 
220 

Gly Thr 
235 

Ala Pro 
250 

Pro Asn 
265 

Thr Gly 
280 

Pro Val 
295 



Glu Thr 
Glu Gly 



Lys 
Val 



Ser Ser Leu 



Gly Ala 
Arg Thr 
Gin His 
Gly Ser 



Leu 



Thr 



Leu 



Thr 



Pro Ser Arg 



Arg Thr 
Thr Thr 
Lys lie 
lie Pro 
Arg Gly 
Asp lie 
Leu Gin 
Gin Tyr 
Val Gin 



Ser 



Gly 



Pro 



Gly 



Leu 



Ser 



Pro 



Thr 



Leu 



Ala 
60 

Met 
75 

Leu 
90 

Gin 
105 

Ala 
120 

Leu 
135 

Leu 
150 

Thr 
165 

Gly 
180 

Ser 
195 

Gly 
210 

Tyr 
225 

Phe 
240 

Ser 
255 

Gly 
270 

Leu 
285 

His 
300 



Gin Asp lie 
Ala Ala Arg 
Gly Gin Leu 
Ser Leu Leu 
His Lys Asp 
Arg Gly Lys 
Cys Val Arg 
Ser Leu Val 
Leu Leu Glu 
Gly Leu Leu 
Leu Leu Asn 
Leu Asn Arg 
Pro Gly Pro 
Gly Thr Ser 
Tyr Ser Pro 
Phe Pro Leu 
Pro Leu Leu 



Leu Gly Ala 
65 

Gly Gin Leu 
80 

Ser Gly Gin 
95 

Gly Thr Gin 
110 

Pro Asn Ala 
125 

Val Arg Phe 
140 

Arg Ala Pro 
155 

Leu Thr Leu 
170 

Thr Asn Phe 
185 

Lys Trp Gin 
200 

Gin Thr Ser 
215 

He His Glu 
230 

Ser Arg Arg 
245 

Asp Thr Gly 
260 

Ser Pro Thr 
275 

Pro Pro Thr 
290 

Pro Asp Pro 
305 



Val Thr Leu 
Gly Pro Thr 
Val Arg Leu 
Leu Pro Pro 
He Phe Leu 
Leu Met Leu 
Pro Thr Thr 
Asn Glu Leu 
Thr Ala Ser 
Gin Gly Phe 
Arg Ser Leu 
Leu Leu Asn 
Thr Leu Gly 
Ser Leu Pro 
His Pro Pro 
Leu Pro Thr 
Ser Ala Pro 
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Thr Pro Thr Pro Thr Ser Pro Leu Leu Asn Thr Ser Tyr Thr His 
310 315 320 

Ser Gin Asn Leu Ser Gin Glu Gly 
325 330 332 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1798 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



GCGTCTTCCT ACCCATCTGC TCCCCAGAGG GCTGCCTGCT GTGCACTTGG 50 

GTCCTGGAGC CCTTCTCCAC CCGGATAGAT TCCTCACCCT TGGCCCGCCT 10 0 

TTGCCCCACC CTACTCTGCC CAGAAGTGCA AGAGC CTAAG CCGCCTCCAT 150 

GGCCCCAGGA AGGATTCAGG GGAGAGGCCC CAAACAGGGA GCCACGCCAG 200 

CCAGACACCC CGGCCAGA ATG GAG CTG ACT GAA TTG CTC CTC 242 

Met Glu Leu Thr Glu Leu Leu Leu 
-21 -20 -15 

GTG GTC ATG CTT CTC CTA ACT GCA AGG CTA ACG CTG TCC 281 
Val Val Met Leu Leu Leu Thr Ala Arg Leu Thr Leu Ser 
-10 -5 

AGC CCG GCT CCT CCT GCT TGT GAC CTC CGA GTC CTC AGT 320 
Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser 
15 10 

AAA CTG CTT CGT GAC TCC CAT GTC CTT CAC AGC AGA CTG 3 59 
Lys Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu 
15 20 25 

AGC CAG TGC CCA GAG GTT CAC CCT TTG CCT ACA CCT GTC 3 98 
Ser Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val 
30 35 

CTG CTG CCT GCT GTG GAC TTT AGC TTG GGA GAA TGG AAA 437 
Leu Leu Pro Ala Val Asp Phe Ser Leu Gly Glu Trp Lys 
40 45 50 

ACC CAG ATG GAG GAG ACC AAG GCA CAG GAC ATT CTG GGA 476 
Thr Gin Met Glu Glu Thr Lys Ala Gin Asp lie Leu Gly 
55 60 65 
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GCA GTG 
Ala Val 



GGA CAA 
Gly Gin 
80 

CAG CTT 
Gin Leu 



CAG AGC 
Gin Ser 
105 

ACC ACA 
Thr Thr 



TTC CAA 
Phe Gin 



CTT GTA 
Leu Val 
145 

CCC ACC 
Pro Thr 



ACA CTG 
Thr Leu 
170 

GAG ACA 
Glu Thr 



GGG CTT 
Gly Leu 



CCT GGT 
Pro Gly 
210 

ATC CCC 
lie Pro 



ACC CTT 
Thr Leu 



CTG GGA 
Leu Gly 



TCT GGA 
Ser Gly 
95 

CTC CTT 
Leu Leu 



GCT CAC 
Ala His 
120 

CAC CTG 
His Leu 



GGA GGG 
Gly Gly 



ACA GCT 
Thr Ala 
160 

AAC GAG 
Asn Glu 



AAC TTC 
Asn Phe 
185 

CTG AAG 
Leu Lys 



CTG CTG 
Leu Leu 



GGA TAC 
Gly Tyr 
225 



CTG CTG 
Leu Leu 
70 

CCC ACT 
Pro Thr 



CAG GTC 
Gin Val 



GGA ACC 
Gly Thr 
110 

AAG GAT 
Lys Asp 



CTC CGA 
Leu Arg 
135 

TCC ACC 
Ser Thr 



GTC CCC 
Val Pro 



CTC CCA 
Leu Pro 
175 

ACT GCC 
Thr Ala 



TGG CAG 
Trp Gin 
200 

AAC CAA 
Asn Gin 



CTG AAC 
Leu Asn 



GAG GGA 
Glu Gly 



TGC CTC 
Cys Leu 
85 

CGT CTC 
Arg Leu 



CAG CTT 
Gin Leu 



CCC AAT 
Pro Asn 
125 

GGA AAG 
Gly Lys 



CTC TGC 
Leu Cys 
150 

AGC AGA 
Ser Arg 



AAC AGG 
Asn Arg 



TCA GCC 
Ser Ala 
190 

CAG GGA 
Gin Gly 



ACC TCC 
Thr Ser 
215 

AGG ATA 
Arg lie 



GTG ATG 
Val Met 
75 

TCA TCC 
Ser Ser 



CTC CTT 
Leu Leu 
100 

CCT CCA 
Pro Pro 



GCC ATC 
Ala He 



GTG CGT 
Val Arg 
140 

GTC AGG 
Val Arg 



ACC TCT 
Thr Ser 
165 

ACT TCT 
Thr Ser 



AGA ACT 
Arg Thr 



TTC AGA 
Phe Arg 
205 

AGG TCC 
Arg Ser 



CAC GAA 
His Glu 
230 



GCA GCA 
Ala Ala 



CTC CTG 
Leu Leu 
90 

GGG GCC 
Gly Ala 



CAG GGC 
Gin Gly 
115 

TTC CTG 
Phe Leu 



TTC CTG 
Phe Leu 



CGG GCC 
Arg Ala 
155 

CTA GTC 
Leu Val 



GGA TTG 
Gly Leu 
180 

ACT GGC 
Thr Gly 



GCC AAG 
Ala Lys 



CTG GAC 
Leu Asp 
220 

CTC TTG 
Leu Leu 



CGG 515 
Arg 



GGG 554 
Gly 



CTG 593 
Leu 



AGG 632 
Arg 



AGC 671 

Ser 

130 

ATG 710 
Met 



CCA 74 9 
Pro 



CTC 788 
Leu 



TTG 827 
Leu 



TCT 866 

Ser 

195 

ATT 905 
He 



CAA 944 
Gin 



AAT 983 
Asn 
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GGA ACT CGT GGA CTC TTT CCT GGA CCC TCA CGC AGG ACC 1022 
Gly Thr Arg Gly Leu Phe Pro Gly Pro Ser Arg Arg Thr 
235 240 245 

CTA GGA GCC CCG GAC ATT TCC TCA GGA ACA TCA GAC ACA 1061 
Leu Gly Ala Pro Asp lie Ser Ser Gly Thr Ser Asp Thr 
250 255 260 

GGC TCC CTG CCA CCC AAC CTC CAG CCT GGA TAT TCT CCT 1100 
Gly Ser Leu Pro Pro Asn Leu Gin Pro Gly Tyr Ser Pro 

265 270 

TCC CCA ACC CAT CCT CCT ACT GGA CAG TAT ACG CTC TTC 113 9 
Ser Pro Thr His Pro Pro Thr Gly Gin Tyr Thr Leu Phe 
275 280 285 

CCT CTT CCA CCC ACC TTG CCC ACC CCT GTG GTC CAG CTC 1178 
Pro Leu Pro Pro Thr Leu Pro Thr Pro Val Val Gin Leu 
290 295 

CAC CCC CTG CTT CCT GAC CCT TCT GCT CCA ACG CCC ACC 1217 
His Pro Leu Leu Pro Asp Pro Ser Ala Pro Thr Pro Thr 
300 305 310 

CCT ACC AGC CCT CTT CTA AAC ACA TCC TAC ACC CAC TCC 1256 
Pro Thr Ser Pro Leu Leu Asn Thr Ser Tyr Thr His Ser 
315 320 325 

CAG AAT CTG TCT CAG GAA GGG TAA GGTTCTCAGA CACTGCCGAC 1300 
Gin Asn Leu Ser Gin Glu Gly 

330 332 

ATCAGCATTG TCTCATGTAC AGCTCCCTTC CCTGCAGGGC GCCCCTGGGA 1350 
GACAACTGGA CAAGATTTCC TACTTTCTCC TGAAACCCAA AGCCCTGGTA 1400 
AAAGGGATAC ACAGGACTGA AAAGGGAATC ATTTTTCACT GTACATTATA 1450 
AACCTTCAGA AGCTATTTTT TTAAGCTATC AGCAATACTC ATCAGAGCAG 1500 
CTAGCTCTTT GGTCTATTTT CTGCAGAAAT TTGCAACTCA CTGATTCTCT 1550 
ACATGCTCTT TTTCTGTGAT AACTCTGCAA AGGCCTGGGC TGGCCTGGCA 1600 
GTTGAACAGA GGGAGAGACT AACCTTGAGT CAGAAAACAG AGAAAGGGTA 1650 
ATTTCCTTTG CTTCAAATTC AAGGCCTTCC AACGCCCCCA TCCCCTTTAC 1700 
TATCATTCTC AGTGGGACTC TGATCCCATA TTCTTAACAG ATCTTTACTC 1750 
TTGAGAAATG AATAAGCTTT CTCTCAGAAA AAAAAAAAAA AAAAAAAA 1798 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1798 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



rrt rri rTi rn nn rri ryi ryi iti rri 


rp rri rri rri rn rri rri rri rri 
1111111111 


TPTGAGAGAA 


AGPTTATTPA 
riov- 1 X rt ± 1 


TTTPTPAAGA 

1 1 lv.1 \_rtrt\jrt. 


en 

Z> VJ 




GTTA AG A ATA 
Vj 1 1 rlrt Vjrxrl 1 rl 


TGGGATPAGA 

1 VJVJVJrt 1 V^rtvjrt. 


nTPPPAPTPA 
vjl Cv* v_.rt.v_ 1 Vjrt. 


GA ATGATAGT 
vjrtrt. 1 Vjrt. 1 rtVj 1 


i nn 


rlr^riVjvjVjVj.r-1 1 Vj 


VjVjvjVjv_Vj 1 1 vjvj 


A AGGPPTTn A 


ATTTGAAGPA 
HI 1 1 vjrtrt.VjV_rt. 


A Ann A A ATT A 
rtrt.vjvjrt-rtrt 1 1 rt. 


i Rn 

1 DU 


P P P T T T P TP T 


PTTTTPTn A P 
ul 1 1 1L1 Vj.riV_ 


TP A AnflTT An 
1 v_.rtrt\jvj 1 1 rtvj 


TPTPTPPPTP 
ILILILLLIL 


THITTP A A PTn 
lull v_rtrt.v_ 1 Vj 


9 nn 
.z u vj 


v_J-1Vj vj v., V^riVj v_ 


PPAGGPPTTT 

V_Vwrt.VjVjV_V_ Ill 


GPAGAGTTAT 
vjv_.rtVjrt.Vj 1 1 rt 1 


PAPAGAAA A A 
v^riv^rxvjrinnriri 


GAPlPATfJTAG 
VjrtVj v_rt. 1 Vj 1 rtVj 


9 Rn 


AGAATPAGTG 
r\vj,rxrl J. v_rivj X VJ 


AGTTGPAAAT 

rtVJ 1 lVjV_rtrtLrtl 


TTPTGPAGAA 

1 1 v_ 1 vjv_rtvjrtrt. 


AATAGAPPAA 

rtrt. 1 rt.Vj.rtV_ V_rtrt 


AGAGPTAGPT 

rtVjrt.vjV^ . 1 rtvjv_ 1 


o u u 


vjv_ 1 v_ 1 \jr\ 1 \jr\ 


GTATTGPTGA 

VJlril lvjv_lvjrt. 


TAGPTTA A A A 
lrtvjv_l ± rtT-lrtrt. 


A A ATAGPTTP 


TG A AGGTTT A 
1 vjrttrtvjvj 1 1 In 


J DU 


TAATGTAPAG 

1 rifl 1 vj j. raV_,tt\j 


TGAAAAATGA 


TTPPPTTTTP 

1 1 V_V_V_ 1 1 1 1 v_ 


AGTPPTGTGT 

rtVj 1V_V_1Vj1Vj1 


ATPPPTTTT A 
rllV^LLl 1 1 1 rt. 


Ann 

*± VJ VJ 


ppagggpttt 

V_V^.rlVJVJVJV-. J. 1 1 


GGGTTTPAGG 

VJVJVJ 111 V_.rt.VJVJ 


AGAAAGTAGG 

rtvjrtrtrtvj 1 rtvjvj 


A A ATPTTGTP 
rtrtrt 1 L 1 1 vj 1 v_ 


PAGTTGTPTP 
v_.rl.vj 1 1 Vj 1 V_ 1 V_ 


/en 


ppaggggpgp 


PPTGPAGGGA 

V_V_ 1 VJV_rt.VjVjVj.rt, 


AGGGAGPTGT 
rt.vjvjVjrt.vJv_ 1 Vj 1 


APATGAGAPA 


ATnPTnATnT 

rt 1 vjv_ 1 ori 1 Vj 1 


D VJ U 


PGGPAGTGTP 

V_VjVjV_.raVj iulV- 


TGAGA APPTT 
1 VjrtVj.rt-rt.V_ LI 1 


APPPTTPPTn 
rt.v_. ILLlu 


AnAPAnATTP 
rt.VjrtV_rt.vjrt. 1 1L 


1 VjVJVJrt.Vj 1 VJVJVJ 


D D VJ 


TGTAGGATGT 

IVjl .rt.vjvj.rt. 1 Vj J. 


GTTTAGAAGA 
vj 1 1 1 rtvjrtyrt.vj.rt. 


GGGPTGGTAG 

VjVJVjV_ 1 VJVJ 1 rtVJ 


GGGTGGGPGT 

VJVJVJ 1 VjVjVjV_.Vj 1 


TGnAGPAGA A 
1 VjVj_riVjv_.rVJ.rirl 


enn 

O VJ VJ 


GGGTPAGGAA 

vjvjvj i. \^ri\jvjrifi 


GPAGGGGGTG 

VJV_X"1VJ VJVJ VJVJ -L VJ 


GAGPTGGAPP 

\jn.vjv_ x vjvj.ri.v_v_ 


APAGGGGTGG 
rt.v_rtvjvjvjvj x vjvj 


GPAAGGTGGG 
vjv_.rtrt.vjvj x vjvjvj 


6 sn 

O -_* VJ 


TGGAAGAGGG 


AAGAGCGTAT 


ACTGTCCAGT 


AGGAGGATGG 


GTTGGGGAAG 


700 


GAGAATATCC 


AGGCTGGAGG 


TTGGGTGGCA 


GGGAGCCTGT 


GTCTGATGTT 


750 


CCTGAGGAAA 


TGTCCGGGGC 


TCCTAGGGTC 


CTGCGTGAGG 


GTCCAGGAAA 


800 


GAGTCCACGA 


GTTCCATTCA 


AGAGTTCGTG 


TATCCTGTTC 


AGGTATCCGG 


850 


GGATTTGGTC 


CAGGGACCTG 


GAGGTTTGGT 


TCAGCAGACC 


AGGAATCTTG 


900 


GCTCTGAATC 


CCTGCTGCCA 


CTTCAGAAGC 


CCAGAGCCAG 


TAGTTCTGGC 


950 


TGAGGCAGTG 


AAGTTTGTCT 


CCAACAATCC 


AGAAGTCCTG 


TTTGGGAGCT 


1000 


CGTTCAGTGT 


GAGGACTAGA 


GAGGTTCTGC 


TGGGGACAGC 


TGTGGTGGGT 


1050 
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• 






• 




GGGGCCCGCC 


TGACGCAGAG 


GGTGGACCCT 


CCTACAAGCA 


TCAGGAAACG 


1100 


CACCTTTCCT 


CGGAGCAGGT 


GTTGGAAGCT 


CAGGAAGATG 


GCATTGGGAT 


1150 


CCTTGTGAGC 


TGTGGTCCTG 


CCCTGTGGAG 


GAAGCTGGGT 


TCCAAGGAGG 


1200 


CTCTGCAGGG 


CCCCAAGGAG 


GAGACGGACC 


TGTCCAGAAA 


GCTGCCCCAG 


1250 


GAGGGATGAG 


AGGCAAGTGG 


GTCCCAGTTG 


TCCCCGTGCT 


GCCATCACTC 


1300 


CCTCCAGCAG 


AAGGGTCACT 


GCTCCCAGAA 


TGTCCTGTGC 


CTTGGTCTCC 


1350 


TCCATCTGGG 


TTTTCCATTC 


TCCCAAGCTA 


AAGTCCACAG 


CAGGCAGCAG 


1400 


GACAGGTGTA 


GGCAAAGGGT 


GAACCTCTGG 


GCACTGGCTC 


AGTCTGCTGT 


1450 


GAAGGACATG 


GGAGTCACGA 


AGCAGTTTAC 


TGAGGACTCG 


GAGGTCACAA 


1500 


GCAGGAGGAG 


CCGGGCTGGA 


CAGCGTTAGC 


CTTGCAGTTA 


GGAGAAGCAT 


1550 


GACCACGAGG 


AGCAATTCAG 


TCAGCTCCAT 


TCTGGCCGGG 


GTGTCTGGCT 


1600 


GGCGTGGCTC 


CCTGTTTGGG 


GCCTCTCCCC 


TGAATCCTTC 


CTGGGGCCAT 


1650 


GGAGGCGGCT 


TAGGCTCTTG 


CACTTCTGGG 


CAGAGTAGGG 


TGGGGCAAAG 


1700 


GCGGGCCAAG 


GGTGAGGAAT 


CTATCCGGGT 


GGAGAAGGGC 


TCCAGGACCC 


1750 


AAGTGCACAG 


CAGGCAGCCC 


TCTGGGGAGC 


AGATGGGTAG 


GAAGACGC 1798 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gly Val His Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser 
15 10 15 

Leu Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 

20 25 30 

Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu 

35 40 45 

Ala Lys Glu Ala Glu Asn lie Thr Thr Gly Cys Ala Glu His Cys 

50 55 60 
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Ser Leu Asn Glu Asn lie Thr Val Pro Asp Thr Lys Val Asn . Phe 

65 70 75 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val 

80 85 90 

Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin 

95 100 105 

Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu 

110 115 120 

His Val Asp Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu 

125 130 135 

Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala lie Ser Pro Pro Asp 

140 145 150 

Ala Ala Ser Ala Ala Pro Leu Arg Thr lie Thr Ala Asp Thr Phe 

155 160 165 

Arg Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly Lys Leu 

170 175 180 

Lys Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 

185 190 193 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser Pro Ala Pro Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 
15 10 15 

Leu Arg Asp Asp His Val Leu His Gly Arg 

20 25 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu 
15 10 15 

Leu Arg Asp Ser His Val Leu His Ser Arg Leu 
5 20 25 26 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 26 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



15 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Ser Pro Ala Pro Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 
15 10 15 



Leu Arg Asp Asp Xaa Val Leu His Gly Arg Leu 
20 20 25 26 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 25 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



( (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

30 

Ser Pro Ala Pro Pro Ala Xaa Asp Pro Arg Leu Leu Asn Lys Leu 
15 10 15 



Leu Arg Asp Asp His Val Leu His Gly Arg 
35 20 25 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 14 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Xaa Pro Ala Pro Pro Ala Xaa Asp Pro Arg Leu Xaa Asn Lys 
15 10 14 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
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(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



GCCGTGAAGG ACGTGGTCGT CACGAAGCAG TTTATTTAGG AGTCG 45 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



CCNGCNCCNC CNGCNTGYGA 20 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



NCCRTGNARN ACRTGRTCRT C 21 
(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS.: 

(A) LENGTH: 69 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



CCAGCGCCGC CAGCCTGTGA CCCCCGACTC CTAAATAAAC TGCCTCGTGA 50 
TGACCACGTT CAGCACGGC 69 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CCAGCACCTC CGGCATGTGA CCCCCGACTC CTAAATAAAC TGCTTCGTGA 50 
CGACCACGTC CATCACGGC 69 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Pro Arg Leu Leu Asn Lys Leu Leu Arg 
15 9 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



CCAGCACCGC CGGCATGTGA CCCCCGACTC CTAAATAAAC TGCTTCGTGA 50 
CGATCATGTC TATCACGGT 69 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GCTAGCTCTA GAAATTGCTC CTCGTGGTCA TGCTTCT 37 
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(2) INFORMATION FOR SEQ ID NO: 21: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 



CAGTCTGCCG TGAAGGACAT GG 22 
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